NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PropBank Comes of Age—Larger, Smarter, and more Diverse

https://doi.org/10.18653/v1/2022.starsem-1.24

Pradhan, Sameer; Bonn, Julia; Myers, Skatje; Conger, Kathryn; O’gorman, Tim; Gung, James; Wright-bettner, Kristin; Palmer, Martha (July 2022, Proceedings of the 11th Joint Conference on Lexical and Computational Semantics)
Vivi Nastase; Ellie Pavlick; Mohammad Taher Pilehvar; Jose Camacho-Collados; Alessandro Raganato (Ed.)
This paper describes the evolution of the PropBank approach to semantic role labeling over the last two decades. During this time the PropBank frame files have been expanded to include non-verbal predicates such as adjectives, prepositions and multi-word expressions. The number of domains, genres and languages that have been PropBanked has also expanded greatly, creating an opportunity for much more challenging and robust testing of the generalization capabilities of PropBank semantic role labeling systems. We also describe the substantial effort that has gone into ensuring the consistency and reliability of the various annotated datasets and resources, to better support the training and evaluation of such systems
more » « less
Full Text Available
An Instance Level Approach for Shallow Semantic Parsing in Scientific Procedural Text

https://doi.org/10.18653/v1/2020.findings-emnlp.270

Swarup, Daivik; Bajaj, Ahsaas; Mysore, Sheshera; O’Gorman, Tim; Das, Rajarshi; McCallum, Andrew (November 2020, Findings of the Association for Computational Linguistics: EMNLP 2020)
null (Ed.)
In specific domains, such as procedural scientific text, human labeled data for shallow semantic parsing is especially limited and expensive to create. Fortunately, such specific domains often use rather formulaic writing, such that the different ways of expressing relations in a small number of grammatically similar labeled sentences may provide high coverage of semantic structures in the corpus, through an appropriately rich similarity metric. In light of this opportunity, this paper explores an instance-based approach to the relation prediction sub-task within shallow semantic parsing, in which semantic labels from structurally similar sentences in the training set are copied to test sentences. Candidate similar sentences are retrieved using SciBERT embeddings. For labels where it is possible to copy from a similar sentence we employ an instance level copy network, when this is not possible, a globally shared parametric model is employed. Experiments show our approach outperforms both baseline and prior methods by 0.75 to 3 F1 absolute in the Wet Lab Protocol Corpus and 1 F1 absolute in the Materials Science Procedural Text Corpus.
more » « less
Full Text Available
Unsupervised Parsing with S-DIORA: Single Tree Encoding for Deep Inside-Outside Recursive Autoencoders

https://doi.org/10.18653/v1/2020.emnlp-main.392

Drozdov, Andrew; Rongali, Subendhu; Chen, Yi-Pei; O’Gorman, Tim; Iyyer, Mohit; McCallum, Andrew (November 2020, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP))
null (Ed.)
The deep inside-outside recursive autoencoder (DIORA; Drozdov et al. 2019) is a self-supervised neural model that learns to induce syntactic tree structures for input sentences *without access to labeled training data*. In this paper, we discover that while DIORA exhaustively encodes all possible binary trees of a sentence with a soft dynamic program, its vector averaging approach is locally greedy and cannot recover from errors when computing the highest scoring parse tree in bottom-up chart parsing. To fix this issue, we introduce S-DIORA, an improved variant of DIORA that encodes a single tree rather than a softly-weighted mixture of trees by employing a hard argmax operation and a beam at each cell in the chart. Our experiments show that through *fine-tuning* a pre-trained DIORA with our new algorithm, we improve the state of the art in *unsupervised* constituency parsing on the English WSJ Penn Treebank by 2.2-6% F1, depending on the data used for fine-tuning.
more » « less
Full Text Available
Theoretical and Practical Issues in the Semantic Annotation of Four Indigenous Languages

https://doi.org/10.18653/v1/2021.law-1.2

Van Gysel, Jens E.; Vigus, Meagan; Denk, Lukas; Cowell, Andrew; Vallejos, Rosa; O’Gorman, Tim; Croft, William (January 2021, Proceedings of the Joint 15th Linguistic Annotation Workshop (LAW) and 3rd Designing Meaning Representations (DMR) Workshop)

Full Text Available
Designing a Uniform Meaning Representation for Natural Language Processing

https://doi.org/10.1007/s13218-021-00722-w

Van Gysel, Jens E.; Vigus, Meagan; Chun, Jayeol; Lai, Kenneth; Moeller, Sarah; Yao, Jiarui; O’Gorman, Tim; Cowell, Andrew; Croft, William; Huang, Chu-Ren; et al (April 2021, KI - Künstliche Intelligenz)
null (Ed.)
In this paper we present Uniform Meaning Representation (UMR), a meaning representation designed to annotate the semantic content of a text. UMR is primarily based on Abstract Meaning Representation (AMR), an annotation framework initially designed for English, but also draws from other meaning representations. UMR extends AMR to other languages, particularly morphologically complex, low-resource languages. UMR also adds features to AMR that are critical to semantic interpretation and enhances AMR by proposing a companion document-level representation that captures linguistic phenomena such as coreference as well as temporal and modal dependencies that potentially go beyond sentence boundaries.
more » « less
Full Text Available

Search for: All records